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Abstract. For a natural extension of the circular unitary ensemble of order 
n, we study as n — > oo the asymptotic behavior of the sequence of orthogonal 
polynomials with respect to the spectral measure. The last term of this se- 
quence is the characteristic polynomial. After taking logarithm and rescaling, 
we obtain a process indexed by t £ [0,1]. We show that it converges to a 
deterministic limit, and we describe the fluctuations and the large deviations. 



1. Introduction 

In this paper, we study the asymptotic behavior of a triangular array of complex 
random variables originating from Random Matrix Theory, and more specifically 
from some models of unitary ensembles. Our results can be viewed as an extension 
of some earlier works on the characteristic polynomial of unitary matrices. Indeed, 
for fixed n, our random variables consist of the sequence of monic orthogonal poly- 
nomials with respect to a random measure associated with a random unitary matrix 
of size n, when evaluated at the point 2 = 1, the last term being the characteristic 
polynomial of that matrix. 

It was already noticed in [3] that the characteristic polynomial of a random 
unitary matrix sampled from the Haar measure on U(n), the unitary group of 
order n has the same law as a product of independent random variables. In a 
previous paper ([5]) we saw that this characteristic polynomial $„(z) (evaluated at 
z = 1) is actually a product of variables 

n-l 

*«(i)= n^-Tj) 

and we named the independent variables jj the modified Verblunsky coefficients, 
referring to the coefficients involved in the Schur recursion for orthogonal polyno- 
mials on the unit circle (OPUC), as dubbed by Simon [25] ■ These variables (for 
j < n) have an explicit density in the open unit disc, depending on their rank j. 
We proved in [5] that this description stays valid when we change the probability 
measure on U(n), by considering the Circular Jacobi Ensemble, that we will define 
below. The construction of the deformed Verblunsky coefficients uses the whole 
sequence {$k,n(z), k = 0, . . . , n} of monic orthogonal polynomials with respect to 
the spectral measure of the pair (U, ex) where e\ is a fixed vector. We have ([5] 
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Prop. 2.2) 

fc-1 

^,„(i)=n( i -7i) 

It is then natural to study the triangular array {$fe !n (l),fe = 0,...,n} and in 
order to normalize the time for different values of n, one can study the process 
{&[nt\,n{l)i t € [0, 1]}, where |_ J denotes the integer part, and where, by convention, 
*o,„(l) = l. 

One can note that <fr n ,n is the characteristic polynomial of U and that when 
U is the chosen according to the Haar measure, the sequence of random variables 
( < I ) n,n(l)) has played a crucial role in the recent interactions between random matrix 
theory and number theory (see [2]). It is also worthwhile to note that in [17], Killip 
and Stoiciu considered also a stochastic process indexed by t = k/n and related 
to the sequence of orthogonal polynomials in the C/3E ensemble. In fact they 
considered variables which are the complex conjugate of our deformed Verblunsky 
coefficients as auxiliary variables in the study of the Priifer phase (Lemma 2.1 in 
[17]). This tool is used again in [15]. Later Ryckman ([53] Section 4) used a version 
of the deformed Verblunsky coefficients in the proof of its joint asymptotic laws. 

To be more precise and to explain the interest of our approach, we now describe 
our model. For n > 1, j3 > and 6 £ C such that D\c S > —1/2, we consider a 
distribution Cjl n ] on the set of probability measures on the unit circle T, supported 
by n points. This family of distribution generalizes the Circular Jacobi Ensemble 
(the notation CJ comes from this fact), and it can be defined as follows. If the 
random measure 

n 

for 6j £ [0, 2if), has the distribution Cjl n ], then: 

• The joint density hgl of the law of (0%, . . . , n ), with respect to the Lebesgue 
measure on [0,27r) n , is given by 

n 

(1.1) /4>r, . . . A) = cg|A(e»* . . . , e^)f flfl - e -^) s (l - e^f, 

where > is a normalization constant. 

• The weights (tti, . . . , n n ) follow a Dirichlet law of parameter /?' on the sim- 
plex 7Ti + • • • + 7r„ = 1, TTj > 0, where ft' := [3/2 (we conserve this notation 
f3' in all the sequel of the paper). 

• The tuples (6i, . . . , 9 n ) and (tti, . . . , 7r„) are independent. 

Besides, when U is an unitary matrix and e\ a cyclic vector for U, then the spectral 
measure \i of the pair [U, e±) is defined as the unique (probability) measure on T 
such that 

(ei,U j ei) = / z j n{dz) jeZ. 

When S = 0, it was proved in [16] that the distribution of the spectral measure 
of the pair ({/, ei) where U is randomly sampled from U(n) according to the Haar 
measure and e\ is a fixed vector of C™, for instance (1,0,..., 0), is precisely CJ^q. 
For <5 = and /3 > 0, these authors found a model of random unitary matrices 
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such that the spectral measure has the distribution Cjjj g. In [5], we gave a model 
corresponding to the case 6^0. All these constructions rely on the theory of 
orthogonal polynomials on the unit circle (OPUC) that we recall now. 

From the linearly independent family of monomials {1, z, z 2 , . . . , z™ -1 } in L 2 (T, jj,), 
we construct an orthogonal basis $o,nj • • • > ^n-i,n of monic polynomials by the 
Gram-Schmidt procedure. The n th degree polynomial obtained this way is 

n 

= = I]> - e ^), 

i.e. the characteristic polynomial of U. The $fc's (k = 0, . . . , n) obey the Szego (or 
Schur) recursion relation: 

where 



The coefhcients a 3 (j > 0) are called Schur or Verblunsky coefficients and satisfy 
the condition a , • • • , a„_ 2 € D := {z G C : |z| < 1} and a„_i € T. There is a 
bijection between this set of coefficients and the set of spectral probability measures 
v (Verblunsky's theorem). We can write the orthogonal polynomials with the help 
of a new system of functions built from the Verblunsky coefficients. Setting 

w«=*--*^ = ^i^). (* = 0,...,n-l) 1 

we have j/o( z ) = an d the following decomposition: 

k-l 

$k,n(z) = Y[(z ~ Vj(z)) , fc=l, ...,n. 
If 7j := we get 

(1.2) * fcl „(l) = JJ(1- 7J ) S fc = l,...n 
and in particular 

n-l 

(1.3) det(J-E0 = $ n (l) = $ n , n (l) = IJ(l-7 i ). 

j'=o 

Note that the definition implies | 7 fc| = |afc|, and in particular | 7n _i| = 1. In the 
sequel, following [5] , we refer to the 7 j 's as the deformed Verblunsky coefficients. 

In [5], it is proven that for fi following the distribution CJ^ n ], the coefficients 
( 7 j)o<j<n-i are independent (note that in general it is not true for the classical 
Verblunsky coefficients, except if S = 0) and their distributions are explicitly com- 
putable. More precisely, for r > and 6 G C such that r + 2*He S + 1 > 0, let 
be the density on the unit disc D proportional to 

(i- NT" 1 (i-^(i-^ 

and let be the density on the unit circle U proportional to 

(l-zf(l-z) s . 
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Then, for j < n— 1, jj has density gp)i n _j_ x \ and 7„_i has density \( s \ Note that 
the law of 7 n _i does not depend on n and /3. The values of the normalization con- 
stants can be easily deduced from the computation of integrals which are collected 
in the appendix of the present paper. 

Let us now explain why we choose to focus on the value at 1 of the character- 
istic polynomial (in addition to its major role in the number theory connections 
mentioned earlier). Note that in the case 6 = (which corresponds to the classi- 
cal Circular Jacobi Ensemble, and in particular the Circular Unitary Ensemble for 
j3 = 2), the law of does not depend on z £ T, since the distribution of the 

eigenvalues of U is invariant by rotation. On the other hand, for 6 ^ 0, the point 1 
plays an important role since it is a singularity of the potential. It is then classical 
to study the behavior of $ n (l) in the large n asymptotics (see for example [14], 
[12] , |15| . [13]). Notice that all these authors consider the case 6 = 0. 

Our extension to a study of the array (1), k < n} has its own interest as a 
study of characteristic polynomials. It comes from the following remark. From a 
measure /i carried by n points, one can also define a n x n unitary matrix U n , called 
GGT by Simon [26] section 10, and which is the matrix of the linear application h 
on I/ 2 (T, /z) given by h(f)(z) = zf(z), taken in a basis of orthonormal polynomials 
with respect to fi. If 1 < k < n, one can denote by Gk(U n ) the k x k topleft 
submatrix of U (which is not unitary in general) . Then it is known (see for example 
Forrester [115] Prop. 2.8.2, or Simon [25] proof of Prop. 3.1) that one has for all 
fce{l,2,...,n}: 

(1.4) $ fc , n (l) = det(/ fc - G k (U n )) 

For other aspects of this model see [6] • 

Most of the results we will obtain on the process {$L„ t j in (l), t £ [0, 1]} will in 
fact concern its logarithm. Note that even when (1) ^ 0, its complex logarithm 
is not obvious to define rigorously, since its imaginary part is a priori given only 
modulo 2n. However, there is a natural way to deal with this issue, which is 
described in the appendix. One can then fully justify the following formula: 

fe-i 

(1-5) log$ fei „(l) = ^log(l- 7j ), 

3=0 

when ^^^(l) ^ 1, which occurs almost surely under Cjjj™]. 

We study asymptotic properties of this determinant as n — > oo under essentially 
two regimes: 

• First regime: 6 is fixed and Die 6 > — 1/2 (hence, this regime includes the case 
6 = 0). 

• Second regime: 6 = /3'dn with $He d > 0. 

Some of the results proved here were already announced in [4]. When (3 = 1,2, 4, 
the independence of the random variables ^k+i.n/^k.n k = 0, ... ,n — 1 and the 
identification of their distributions are strongly related to the results of Neretin 
( [TS] Corollary 2.1). In that framework, they can be carried easily on models of 
matrix balls via his Proposition 2.3. Actually our results may be extended to all 
models where a remarkable separation of variables occur (see [H]). More precisely 
the paper is organized as follows. 

In Section 2 we study the variables log <I> J ,n (1) f° r both regimes and prove 
that their expectations converge to some explicit deterministic function of t, and 
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then we study the fluctuations of (log $[„tj !n (l)) as a stochastic process on the 
space of cadlag Revalued functions. It appears that in the first regime, one has 
to distinguish between the case < t < 1 and t = 1; at t = 1 some transition is 
occuring since the normalization is changed. Indeed, anticipating the notation of 
next section where we write log $[„ t j in (l) — Elog $|„ t j^(l) = £ n {t) + 'uin(t) and 

( n (t) = ), we shall prove that in the first regime, {( n (t); t < 1} converges 



to some explicit Gaussian diffusion {Cd(*)> t < 1} while ( n (l)/^/n converges to a 
Gaussian random variable which is independent of the diffusion {Cd(^)i t < !}• I n 
the second regime, {( n (t); < t < 1} converges to some explicit Gaussian diffusion 
and there is no normalization to perform. 

Section 3 is devoted to the establishing Large Deviation Principle (LDP) for the 
distributions of the real and imaginary parts of log $^„ t j „(l) as a two dimensional 
random vector with values in the Skorokhod space endowed with the weak topology. 
We focus our study in the first regime on the case 5 — 0. Again in the first 
regime the case t = 1 is playing a special role and is not included. From the 
contraction principle we are then able to deduce LDP for the marginals at fixed time. 
Our approach is standard: we first compute the normalized cumulant generating 
function, compute its limit as well as its dual transform, and end with exponential 
tightness. 

In Section 4 we discuss the connections between the results of Section 3 and 
existing results on the LDP for the empirical spectral distribution for the circular 
Jacobi ensemble. 

Section 5 gathers in appendix some properties of remarkable functions and den- 
sities used in the proofs, in particular the Gamma function T and the Digamma 
function For z G C\R_, we take the notation: 

£(z) :=logr(*) ; *(*) = ^ . 

The determination of the logarithm in i is chosen in the unique way such that £ is 
holomorphic on C\R_ and real on R^. 

All along the paper, we use the entropy function J defined by : 

!u log u — u + 1 ifu>0 
1 if u = 

+oo if u < 

and its primitive 

ft f2 Q+2 

(1.7) F(t)= / J(u) du= -log*- — +t, (t>0). 

JO Z 4 

When the arguments of J or of F are complex, we choose the principal determina- 
tion of the logarithm. 

2. Convergence and fluctuations 
Let, for mcd > and < t < 1, 

T d {t) = log(l + 25Ked) - log(l + d) - log(l - £ + 2$Hed) + log(l - t + d) 
(2.1) 

Sd(t) = J(l + 2%>d)- J(l + 2y\td-t)-J(l + d)+J(l + d-t). 
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In this section, we are interested in the process 

'Mt) 



Cn(t) 



Vn(t) 



where we have written 

Iog$ Lnt j,„(l) - Elog$ LntJi „(l) = f„(t) + "&,(*) • 

As a consequence of the result just below, Elog $[„ t j !n (l) is finite, and then ( n (t) 
is well-defined. 

2.1. Convergence to a deterministic limit. 

Theorem 2.1. (1) In the first regime, i.e. for fixed S, and for n going to 



infinity, 



(2.2) 



Elog$„(l) = - \ gn + C + o(l) 



(2.3) 



where C is a constant, and for < t < 1, 

Elog$ L „ tJ ,„(l) = io g (i_ t ) + (i). 



(2) In the second regime, i.e. 5 = fS'dn, and for < t < 1, we have 



(2.4) lim Elog$ LntJin (l)-n£d 



uniformly in t. 
(3) In the first regime, we have 

(2.5) lim -J- Cov Cn(l) - \h, 

n->oc logn 



where I 2 denotes the 2x2 identity matrix, and for < t < 1, 
(2.6) lim CovCn(*)= / Z° s ds, 



where 



Z° ■= 

(3(l-t) 



h. 



(4) In the second regime, we have for < t < 1 : 



(2.7) 



lim Cov ( n (t) = I Zfds 



where 



1 



(2-8) Zf 



i 



l-t + 2$Hed 2(1 -t + d) 
1 



V 



am 



2(1 -t + d) 



Jm 



i 



2(1 - t + d) 
1 

2(1 -t + d) / 



(5) In the second regime, 



(2.9) 



sup 

te[o,i] 



^]og$ Lnt j, n (l)-£d(t) 



in i 2 , and t/ien in probability. 



ORTHOGONAL POLYNOMIALS AND CIRCULAR ENSEMBLES 



7 



Proof. Proof of (1) and (2). 

Taking expectations in (|5.19[) and summing up (|5.16j) . we have, for 1 < m < n, 
(2.10) 

n 

Elog$ m , n (l)= [y{P'(k-l) + l + S + 5)-y(p'(k-l) + l + 6)] . 

k=n—m-\-l 

If in (|2.10j) . we keep only the first two terms from the Abel-Plana formula (|5.20|) . 
we get, for m < n — 1, or in the second regime, for m — n large enough, 

[* (/3'(s -l) + l + 5 + $)-V {p{a - 1) + 1 + 6)] ds 

{p'{n - 1) + 1 + 8 + 6) - (/3'(n -m- I) + 1 + 6 + 6) 

(2.11) ~y (j3'(n - 1) + 1 + 6) + ^ (/3'{n -m-l) + l + 6) . 

(Note that in the first regime and for m = n, we would get the terms VP (— /?' + 1 + 6) 
if? (—[}' + 1 + 6 + 6), which are not always well-defined). Integrating = £', we get 
the following expression: 

(2.12) L := I(/3'(n-l)+6 + 6)-I(j3'(n-m-l) + 6 + 6) 

-I(l3'(n - 1) + 6) + 1(13' (n -m-l) + 6), 

where l{x) :— ■^ r i{x + 1) + + 1). From the Binet formula we have, for x > — 1, 

(2.13) £{x) = ±jx\ogx + ( 1 + \ ) logx - + n(x) 



p 2J ° p 



where 



(2.U)n{x) = jpf~ f(s)le~ sx ~ e-°] ds+\j™ e~ s * Q - sf(s) ds 
Now, set 

(2.15) h(n,m;6) = x(n—l + ~~^r \ —z(n — m—l + ~~~Jr~^ 



and 



.2[n-l + -j+X^»-m-l + - 



7 2 (». ( ^ + ^ ) -h(i>. iii:6) 



with 



(2.16) J 2 {n,m;S) = log - 1 + —jjrj - lo S - m - 1 + -^7- 

- log fn - 1 + + log - m - 1 + 

We have: 

L = Ii(n,m;6) + J2 (n, m, <5) + R, 
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where 

R = r 1 (/3>( n -i) + S + 5)-r 1 (j3'(n-m-l)+S + 5) 
-n(/3'(n -1) + S) + n(/3'(n -m-l) + 8). 

In the sequel, we use several times the trivial estimates (for c fixed and x tending 
to infinity) 

(2.17) \og{x + c) = loga; + o(l) ; l{x + c) = l{x) + clog a; + o(l) . 

Let us set m :— [nt\, and t n = m/n, and let us suppose that we are in the first 
regime. If < t < 1, we have 

h(n,m,8) = — log (n - 1 + —J - —log ( n- nt„ - 1 + — ) + o(l) 



and 

which implies 



.Alog(l-t)+ (l), 



J 2 (n,m,5) = o(l), 
L = -Alog(l-i)+i? + o(l). 



In the first regime and for t = 1, we have to estimate 

Elog$„(l) = Elog$„_i,„(l) + + J + + 

Since the constant C can be modified, it is equivalent to deal with m = n — 1 or 
with m — n. Taking m = n — 1 gives for some constants C\ and C*2, 

- ii„ 6 („-i + |) + r(^)-r(|) +0( i) 



-logn + Ci +o(l), 



1, <5) = log (A) - log (*±*) + (1) = C 2 + o(l), 



which implies, for some constant C3, 

L= — logn + R + C 3 + o{l). 

Let us now assume that we are in the second regime. For n large enough, we check 
the following estimates, uniform in t € (0, 1]: 

h(n,nt n ,n0'd) = n£ d (t n ) - J 2 (n,nt n ,nf3'd) + o(l), 

and 

J 2 (n,nt n ,n(3'd) = Td{t n ) + o(l), 

which implies 

L = n£ d (t n ) + Q - J- d (t„) + i? + o(l). 

Hence, (1) and (2) are proven, if we check that in any of the previous situations, 
Elog 3> m ,n(l)— L and i? tend to a constant when n goes to infinity, that this constant 
is zero, except perhaps in the first regime for t = 1, and that the convergence is 
uniform in t in the second regime. The first quantity can be expressed in function 
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of the last integral term of the Abel-Plana formula, and the second one comes from 
the remaining integral term of the Binet formula. 

First remaining term: Abel-Plana 

We have to prove the convergence of: 

g(n — m + iy) — g(n + iy) — g(n — m — iy) + g(n — iy) 

e 2*y 1 i dy ' 

when n goes to infinity, for 

g(z) := V(fl'(z -l) + l + S + S)- V{j?{z - 1) + 1 + 6), 

the limit being zero, except in the first regime for t = 1. Moreover, in the second 
regime, we want to check that the convergence is uniform with respect to t, i.e. 
with respect to m € {1, . . . ,n}. Note that in the second regime, the function g 
depends on n via 6 = nd/3'. It is then sufficient to check the following result: for 
any sequence (fc„)„>i of integers such that 1 < k n < n, 

g(k n + iy) - g(k n - iy) , 
— ay 



e 2«y _ I 

is well-defined for all n > 1, and tends to zero when n goes to infinity, in any case 
and uniformly with respect to the sequence (k n ) n >i in the second regime, and when 
(k n )n>i tends to infinity in the first regime. Indeed, we get the desired result by 
taking successively k n — n and k n = n — m. Note that in the first regime for t = I, 
we need to take k n = 1 (m = n — 1), which gives an integral independent of n, 
possibly different from zero. Now, we have 

g(k n ± iy) = ± P'iy) - *(A + LB ± P'iy), 

A , A' > 0, B G K being given by 

A a +iB Q = [3'(k n -l) + l + 6 

and 

A' = fi'{kn — l) + l + S + 6. 

In any case, Aq and A' tend to infinity with n, since in the first regime, k n goes to 
infinity, and in the second regime, it is the case for the real part of <5. Moreover, 
in the second regime, A and A 1 are greater than nff 5He d, independently from the 
sequence (k n ) n >i. Hence, it is sufficient to check that 

V(A + iB + iC) - V(A + iB - iC) 



J e^ c /P' - 1 



■dC 



is well-defined for A > and B £ R, and tends to zero, uniformy in B, for A — » oo. 
From (|5.10|) we have the following: 



-i mA + iB + iC) - HA + IB - iC)) = 2Cg {A + k+ \ B)2 + c2 • 

To estimate the sum of this series, we first apply the crude estimates \z\ > \3mz\ 
and \z\> \D\z z\ to the denominator: 

\(A + k + iB) 2 + C 2 \ > 2\B\\A + k\ 

\(A + k + iB) 2 + C 2 \ > \(A + k) 2 - B 2 + C 2 \. 
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The first inequality implies, for A + k < 2\B\, 

\(A + k + iB) 2 + C 2 \ > (A + kf, 
and the second implies, for A + k > 2\B\, 



\(A + k + iBf + C 2 \ > - A (A + kf + C 2 . 



Hence, 



^ (A + k + LB) 2 + C 2 



< 



1 

A 2 



Therefore, 

\%{A + iB + iC) - *(A + LB - iC)\ 

e 2irC/l3> _ 1 



1 



1 



dC <- — + - 



3 V A 2 A 



2C 



dC, 



where the last integral is finite and does not depend on A and B. 
Second remaining term: Binct 

The terms involved in the expression of R are of the form: 



(2.18) 

and 

(2.19) 



Ri(x,y) 



'/ool [ 



ds 



ds. 



where x = ft' (n — m — 1) + a and y — f3'(n — 1) + a with successively a = S + 5 
and a = 5. If t < 1 or if we are in the second regime, the real parts of x and y 
tend to infinity with n: moreover, the convergence is uniform in t in the second 
regime. Then, as in [35], the dominated convergence theorem allows to conclude 
that Ri(x,y) and R2(x,y) tend to zero, uniformly in t in the second regime. If 
t = 1 and if we are in the first regime, then y tends to infinity and x = a does not 
depend on n, which implies that R\(x,y) and R2(x,y) are still converging when n 
goes to infinity. 

Proof of (3) and (4): Computation of covariances. 

By independence of the variables 7j, we can sum up variances or covariances 
issued from (|5. 18|) . We can then prove the announced result in the same way as 
(1) and (2), using the Abel-Plana summation and the Binet formula again, with *S> 
replaced by we omit the detail. 

Proof of (5): the process (Cn(*))o<t<i is a two-dimensional martingale. Hence, 
by Doob's inequality: 



E 



sup |C„(i)| 2 
te[o,i] 

1 



<4E[|C„(1)| 2 ] 
dt 

/3' 7o 1 -t + 2med 
Moreover, by the uniform convergence (2) 

\nt\ 



4Tr(CovCn(l)) 



— log 



l + 2<Hcd 
2%>d 



< oo. 



sup 

te[o,i] 



2 PJ te[o,i] 
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and one also has 



te[o,i] V n J te[o,i] 



sup n £ d [i^i ) -£ d (t) < sup \£' d (t)\ < oo. 



Combining these estimates gives the following L 2 bound: 



supE sup |log$LniJ.n(!) - n£ d (t)\ ) < oo, 
n>i \te[o,i] / 




which gives (4) after dividing by n 2 . 

2.2. Fluctuations. We now state a theorem about some limiting distributions 
related to the process {&y nt j^ n (l),t £ [0,1]}. Since it can be shown by adapting 
the arguments of Killip and Stoiciu [17] for similar results, we omit the proof of our 
theorem, except for the last part. 

Let Dt and D be the space of cadlag R 2 -valued functions on [0,T] and [0, 1) 
respectively, starting from zero. The next theorem is about some limiting distribu- 
tions related to the process {( n (t),t e [0,1]} or {( n (t),t < 1}. More precisely let 
(Z^) 1 / 2 denote the positive symmetric square root of Zf defined in (|2.8|) and let 
B t be a standard two dimensional Brownian motion. 

Theorem 2.2. (1) If 5 = f3'dn with 9^ed > (second regime), then as n — > oo 
the process {Cn(t) ; t £ [0,1]}„ converges in distribution in the Skorokhod 
space Di to the Gaussian diffusion {£ t d ; t £ [0, 1]}, solution of the stochas- 
tic differential equation: 



(2) If S is fixed with ?le S > — 1/2 (first regime ), then the joint law of the process 
{Cn(t) ; t < 1} (with trajectories in the Skorokhod space D) and the variable 



converges, when n goes to infinity, to the joint distribution of {£ t ° ; t < 1} 
and jV(0; + i -*V(0; /3 ), the process and the two gaussian variables 
being independent. 

Notice that the convergence in law of & is an extension of the celebrated Keating 
and Snaith result [T4"] . 

Proof. In order to prove (1), and in (2), the convergence of {( n (t) ; t < 1} and 
0, taken separately, we apply a version of the Lindeberg-Levy-Lyapunov criterion, 
available for convergence of processes ([13] Chap. 3c). For t < 1, or in the second 
regime, it is enough to prove that 



(2.20) 



dtt = {Zty' 2 aTl t . 



9 := 



log <£>„(!) 




LntJ-l 



(2.21) 




where 



Afc=tog(l-7 fc )-Epog(l-7fc)]. 
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For t = 1 in the first regime, it is sufficient to check: 

1 n— 1 

(2.22) £ E |A fc | 4 -> 0. 
Now, 

E|A fc | 4 < 8E(«eA fc ) 4 + 8EpmA fc ) 4 

< 24 Var 2 (%> log(l - 7 fe )) + 24 Var 2 (<Re log(l - 7 fe )) 

+SKi{mc log(l - 7fe)) + 8K 4 (3m log(l - 7fe )), 

where K4 denotes the fourth cumulant. Using the second and fourth order deriva- 
tives of the function A introduced in the appendix, we see that E|Afe| 4 is a linear 
combination of terms of the form ty"'(r + 1 + a) and ^'(r + 1 + a)^>'(r + 1 + a'), 
for r = /3'(n - k - 1) and a, a' e {6, 6,6 + 6}. Now, by for 9\ex > 0, 

and 

Hence, all the terms involved in the expression of E| | 4 are dominated by (n— k)~ 2 , 
and they are dominated by rT 2 in the second regime or for t < 1. Hence, 

\nt\-l 
k=0 

is dominated by except in the first regime for t — 1, in which case it is bounded. 
This shows the desired results (|2~2"Tj) and ([2~22]l . 

In order to prove the convergence of the joint distribution in (2), we can follow 
the scheme of [55] p. 209. We take < to < t\ < 1. From the above results, 

(2.23) l°sWn(l) ^ Q 

Vlogn 



in probability, so that 

log$„(l)-log$ LntlJ , n (l) - 4 log 7 



the two gaussian variables being independent. Now, for n large enough, 

n-l 

iog$ n (i)-io g $ LntlJin (i)= J2 C 1 -™). 

/c— L^iii J 

is independent of {Cn(^) ; ^ < ^o}, which is function of {"ik)k<nt a (recall that the 
variables jk are independent). Since, {( n (t) ; t < t } tends in law to ; t < t } 
(as a process with trajectories in D to ), we deduce that 



log $„(!)- log $ LntlJ)n (l)- ^logn 



Vlogn 
tends in law to 

({C t ° ; t<to};jV(0;j8- 1 )+iAr(0; ) 8- :l )) 
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{Ct° ; t<t } being independent of JV(0; +W(0; /3" 1 ). Using again ([2~23]) . we 
deduce that 



{CnW ; * < to} 



log$„(l) - |rlogn S 



\/logn / 

has the same limiting distribution. Taking t — > 1 gives part (2) of the theorem. 

3. Large deviations 

3.1. Notation and main statements. Throughout this section, we use the stan- 
dard notation of [TJ. In particular we write LDP for Large Deviation Principle. We 
say that a sequence (P n )neN of probability measures on a Polish space X satisfies a 
LDP with speed a n (going to infinity with n) and rate function / iff I : X — > [0, oo] 
is lower semicontinuous and if 

For every open set O C X lim inf — log P n (0) > — inf / , 

' n a n O 

For every closed set F C X lim inf — log P n (F) > — inf I . 

n a n F 

The rate function is good if its level sets are compact. Moreover, if X n are X random 
variables distributed according to P n , we say that the sequence (X n ) satisfies the 
LDP if the sequence (P n ) satisfies the LDP. 

The reader may have some interest in consulting [8] and mainly |22j where a 
similar method is used for a different model. 

For T < 1, let Mt stand for the space of signed measures on [0,T] and let M< 
be the subspace of Mi consisting of measures whose support is a compact subset of 
[0, 1). We endow D x D with the weak topology a(D x D, M< x M<). So, D x D 
is the projective limit of the family, indexed by T < 1, of the topological spaces 
(D T x D T ,a(D T x D T ,M T x M T )). 

Let Vi (resp. V r ) be the space of left (resp. right) continuous R-valued functions 
with bounded variations. We put a superscript T to specify the functions on [0, T]. 
There is a bijective correspondence between and Mt: 

- for any v G Vf , there exists a unique /z £ Mt such that v — fi([0, •]); we denote 
it by v , 

- for any /j, G Mt, v — ju([0, ■]) is in Vj . 

For v £ D, let v — v a + v s be the Lebesgue decomposition of the measure v in 
absolutely continuous and singular parts with respect to the (vectorial) Lebesgue 
measure. The measure v a can then be identified with its density. 

Now, for £, r\ £ K, let us define: 

(3.1) H a (£,r)) :=-£-]og(2cos»7-e*), 

if \r]\ < ir/2 and 2cos7? — > 0, and H a (£,r]) — oo otherwise. For ip, ip £ V r and 
T < 1, let us denote: 

r T (l - T)H a (Mr),i>a(T)) dr + J T (1 - r)d(-0.)(r) 

(3.2) Ho(T, ip, ip) :— ^ if ijj s = and — ip s is a positive measure , 

-oo otherwise. 

Theorem 3.1. (1) For every T < 1, the sequence 

{n~ l (?it log $ LntJ ,„(!), 3m log $ LntJ ,„(!)) ; *g [0,T]} 
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under the CflE^ measure (i.e. first regime and 6 = 0) satisfies in (Dt X 
Dt, <t(Dt x Dt, Mt X Mt)) the LDP with speed f3'n 2 with good rate function 
Ho(T,ip,ip). 
(2) For y\z d > and T < 1, the sequence 

{n- 1 (!EHclog$ Lrit j !n (l),Jmlog$ L „ t j,„(l)) t e [0,T]} 

under the Cjj^, dn measure (second regime) satisfies in (Dt X Dt,o(Dt X 
Dt,Mj x Mt)) tte LDP wzi/i speed (3'n 2 with good rate function 

(3.3) H d (T, cp, = H (T, cp, V) - 2(JRe d)<p{T) - 2(3md)^(T) - C d (T) 

with 

C d (T) = F(l + d) - F(l - T + d) + F(l + d) — F(l — T + d) 
+ 2<Ke d) + F(l - T + 2%> d) - F(l) + F(l - T) . 

We give now a result on the marginal at time T fixed. It is obtained by applying 
the contraction principle to the mapping 

(^^)^MT)i(T)). 

In all the sequel of the paper, we consider either the Cf3E^ n > ensemble (first regime 
for 5 = 0), or the Cji™L dn ensemble for Died > (second regime). In the first case, 
we put d = 0. 

Theorem 3.2. When (fHe d > 0, T < I) or (d = 0, T < I) , the sequence 

{n- 1 (mt log $ L „ TJ (I), 3m log $ L „ T j (!))}„ 
satisfies the LDP in R 2 with speed /3'n 2 with good rate function 

(3.4) h d (T, C, r,) = inf{H d (T, ip, | <p(T) = £, V(T) = 77} • 
In particular ( cf. (3. 3\) ) 

(3.5) fe d (T,^»j) = fto(T,€,t J )-2(J«ed)€-2(amd)» J -Cd(r), 
which allows to compute easily h d when ho is known. 

For the two coordinates, separately, we have the following known result, which 
comes from formula (C.5) in [T2"] . 

Theorem 3.3. Assume d = and (3 = 2. 

(1) The sequence {n^ 1 Cfte log $ n (l)}„ satisfies the LDP in M with speed n 2 and 
rate function given by the dual (Legendre) of the function: 

s 1 ^ log(l + «) - (l + -J log (l + -J - - log (2s) 

for s > 0, and by 00 for s < 0. 7i vanishes for negative values of the 
argument, and it is infinite beyond log 2. 

(2) The sequence {n^ 1 3m log & n (l)}n satisfies the LDP in M with speed n 2 and 
good rate function given by the dual (Legendre) of the function given by 

t i-> log (l + ^\ - 1 log (l + L^j +t arctan(i/2) . 

is finite on (— 7r/2, tt/2), and infinite otherwise. 
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We now give the precise behavior of the first coordinate, in the particular case 
where d is real. 

Theorem 3.4. (1) For (d = 0,T < 1), or (d > 0,T < 1), the sequence 
{n" 1 Cfte log$„(T)}„ satisfies the LDP in M with speed /3'n 2 with good 
rate function hd(T, •, 0). 

(2) Let := J(T) -I- J (±*f) + J < 0. 

(a) //£ <E [£t,T log 2) i/ie equation 

(3.6) J"(i +7 )_ J-(i-T + 7 )- + + j-(i-r+2) =e 

/ias a unique solution 7 and we have 

(3.7) fto(T,£,0)=7£-4,(T, 7 ,0) 
w/iere 

£o(T )7 ,0) := F(l+7)-F(l-T + 7 )-F(l-T) + F(l) 

(3.8) -2F(l + |) + 2F(l-T+^) 

(b) //£<&, «»en 

(3.9) fto(T, & 0) = /i (T, fr, 0) + (1 - T)(£ T - £) . 

(c) //£> Flog 2, i/ien h o {T,£_,0) = 00. 

(3) For (d > 0,T < 1) or (d = 0,T < 1), the function £ i-» /i d (T,£,0) is i/ie 
dwcd function of 

7 -> £ d (T, 7, 0) := £ (T, 7 + 2d, 0) - C (T, 2d, 0) . 

3.2. Proof of Theorem 13.11 We compute the normalized cumulant generating 
function, find its limit, perform the dual transform, study the exponential tightness 
and eventually prove that the LDP is satisfied. 
From (|5.12j) and (|5.13|) we see that 

,3 1Q) CJp, S [{l-liYd-liY] = Cr^ 

for every j < n— 1 and z such that 2£He (S + z)) > —1, where r = f3'(n—j — l). The 
RHS of (|3.10[) does not depend on z. It should then be clear that we can reduce the 
case *Hed > to the case d = 0. The above shift in the argument of the generating 
function provides the linear term — 2(*Kcd)ip(T) — 2(3md)ifj(T) in the rate function, 
and the RHS of (|3~TU)) gives the constant -C d (T). 

• First step: The normalized cumulant generating function. 

Lemma 3.5. Let T < 1. 

(1) Let us assume d = 0. 

(a) For every path (x(r), y(r)) rg [ 0i T] <= x such that x(t) + 1 — t > 
on [0, T], sei 2z(t) := x(t) + iy(r) and 

(3.11) A (T,x,y) := 

/ (.7(1 - r + x(r)) - J(l - r + z(r)) - JTl - r + z(r)) + J(l - r)) dr . 
Jo 
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Then we have: 



(3.12) 

lim — log E exp [ n(3'D\z 
n->oo p'n 1 \ 



(x{t) -iy(r))dlog$ L „ Tj (l) 



(b) In particular, when x(.) = s and y(.) = t, if we set Co(T,s,t) :— 
Ao(T,x,y) (which generalizes the notation Cq(T,^,0) introduced in 
Theorem\3.1$, we have for every s > — (1 — T) 



(3.13) C (T,s,t) = F(l + s)- F(l-T + s) + F(l)- F(l-T) 

-F(l + z) +F{1 — T+z) — F(l + z) + F(l - T + z) 

where 2z = s + it. 
(2) For D\z d > 0, the analogues of h3.11\l and 13.13}) are 

(3.14) k d (T,x,y) = A (T,a: + 29led,y + 23md)-Ao(T,29led,2amd), 
and 

(3.15) £ d (r, s, t) = £q(T, s + 29^e d,t + 23m d) - C (T, 2Ke d, 23m d) , 

fors> -(l-T)-2*Hed. 

Proof. (l)(a) Let us set x(t) = y(r) = z(t) := for t > T, and tj := (j + l)/n for 
j = 0, . . . , n — 1. One has: 



E exp I n/3'mc 



= Eexp n/3'V\t 



{x{t) -it/(r))dlog$ LnTj (l) 

?i-i 



= [] Eexp (n/3'<Ke [(arfo) - iyfa)) log(l - 7i )]) 

3=0 

and using ()5.14|) . we get: 

fT 



log Eexp n/3'me 



{x(t) -iy(r))dlog$ LnTj (l) 



= X [£(71^(1 - Tj + xfa)) + 1) + W(l - Ti ) + 1) 

(3.16) -£{np'(l - Tj + z[Tj)) + 1) - £(n(3'(l - Tj + z{t 3 )) + 1)] . 

Now, the Binet formula (I5.1|) yields: 

/>00 

£(u+l) = log(tt)+logr(u) = (it + l/2)logu-u+l+ / f(s)[e- su -e- s ]ds. 

Jo 

If we apply four times this formula in order to estimate the term indexed by j < n 
in (|3.16[) . the contribution of the term u is 0, the contribution of the term logw, is 



!0g(l " Tj) + l0g(l - Tj + X {Tj)) - log(l - Tj + z(Tj)) - l0g(l - Tj + z(Tj)) , 
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the contribution of the term w log it is proportional to j3'n with coefficient 
J(l - Tj) - J(l - Tj + z(Tj)) - J(l - Tj + ifo)) + J (I - Tj + x(rj)) ; 

dividing by /3'n 2 and performing Riemann sums gives the integral in ()3.1ip . The 
remaining part is a sum of bounded terms which is negligible with respect to n 2 . 

(1) (b) The equality fl3.13[) is obvious by integration. 

(2) To get the expression corresponding to ffted > we just use (|3 . 10|) . ■ 



• Second step ; the Legendre duality. 

It will be convenient to perform a time-change in (|3.11li . setting 

(3.17) x(t) = (1 - t)X(t) ; y(r) = (1 - t)Y{t) ; z(t) = (1 - t)Z{t) 

The above expression (13. lip ) of Ao becomes 



(3.18) 

where 
(3.19) 
i.e. 



(3.20) 



K (T,x,y) 



(l-r)L(X(T),Y(r))dr 



L(X, Y) = J{1 + X)- J{1 + Z)- J(l + Z) 



L(X,Y) = (1 + X)log(l +X) + Farctan 



Y 



2 + X 



X , , 

i + y ) log 



X 



Y 2 



+ — 



Looking for the Legendre dual, we see that for < 7r/2 and < 2 cos Tj, the 
supremum 

L*(£, r,) = sup Xe, + Yr] - L(X, Y) 

X,Y 



is achieved in (X, Y) satisfying 
(3.21) 



Y 



1 + 4) +^ 



y2 



' 2 + X 



tan 77 



i.e. 



X = 



— cos 7] 
cos 77 — |e£ 



Y = 



sin 77 



cos 77 — Tje^ 



Note that under the assumption above, X is admissible, i.e. X > — 1. One deduces 
that (cf. (pTTjl ) 



(3.22) 



L*(Z,v) = -C- log(2co S77 - e«) = H a (£,rj). 



On the other hand, one can check that rj) is infinite if |?7| > 7r/2 or e* > 2 cos 77. 
One deduces that there exists a recession function: 



K— V + oo 



-£ if f < and 77 = , 
00 otherwise . 
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This function can be used to obtain the rate function l-Lo(T, if, if)) when the measures 
ip s and ips are not zero. By using the same methods as in [22] , one deduces 

cT 

(x(T)ekp(r) + 2/(r)#(r)J - A (T, x(.), y(.)) 



n (T, p, ip) 



sup 



= sup 

X(.),r(.)./n 
cT 



(1 - r) X(t)(1(p(t) + Y(T)dtjj(T) - L{X(t),Y(t)) dr 



(l-T)H a ( l pa(T),Mr))dT+ / (l-r)d(-0 5 )(r), 



where the second equality comes from (|3.17[) and (|3.18|) , and where the last equality 
comes from [21] Theorem 5. 

The value of the constant Cd{T) is obtained owing to (|3.13l) : 

C d (T) = -£o(T,2«Hed,23md) . 



• Third step : exponential tightness. 

Exponential tightness is not needed for the second argument since it lives in 
[— 7r/2, 7r/2]. For the first, we have | logcc| < — log a; + 21og2 for x < 2, hence: 

P d ( I log 1 1 — 1 1 > no) <P d ( -log(l >n(a- 2Tlog2)). 

j<nT-l j<nT-l 

Now, for 9 < (by Chernov inequality), 

P d ( ]T - ^ - 2T1 °g2)) < e n2 ^ a - 2T ^E d (|$L«Tj(l)| n ^' e ) 

j<nX-l 

so that, taking logarithm and applying (|3.13j) we get, for 9 € ( — (1 — T) — 2$Hed, 0) 
Kmsup( / 8'n 2 )- 1 logP d ( V |log|l-yj| > m) < 6>(a - 2T log 2) + £ d (T, 6», 0). 

n ^°° j<nT 

It remains to let a — > oo to get the exponential tightness. We remark that when 
d = 0, the exponential tightness holds only for T < 1. ■ 

Remark 3.6. The mean trajectory is obtained when H a (ip,tp) = i.e. cos - !/) = 
cosh ip or ip = %jj = 

3.3. Comment on Theorem 13.21 Let us study the variational problem (I3.4[) 
issued from the contraction. Using (I3.2[) and (|3.22[) . we see that the Euler equation 
is 

and the optimal path is then given by 

■( \ dL ( 1 P \ U \ dL ( 7 
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i.e. 



(3.24) <&(r) = log(l-r + 7)-^log(l-r+|) 

(3.25) ip(r) = arctan ; 



7\ 2 , P 2 



2(1-t)+ 7 

This path will be admissible if there exist 7 and p such that 

(3.26) f ^(r)dr = £, f ^(r)dr = r?. 

Jo Jo 

When the path is admissible, we have 
H a {ip{ r ),i>{r)) = £(<p(t),tP(t)) = -^—<p(t) + j^-fa) - L 

t — T 1 — T 

and 

rT 



ho(T, £, tj) = 7 £ + W - Jf (1 - r)L (jf^, j dr. 

3.4. Comment on Theorem 13.41 In the case d = 0, T = 1, [T^] proved the 
LDP by tackling directly the normalized cumulant generating function. This gives 
an incomplete LDP since there is no steepness in 0. They use a Fourier inversion 
to take into account the negative side. We see that the function £ (l,s,0) is the 
limiting n.c.g.f. of log |$„(1)|, computed in [12] Theorem 3.3. Besides, £q(1, 0, t) is 
the limiting n.c.g.f. of the argument of $ n (l), computed in [TJ], Theorem 3.4, and 

lim WM =± 1. 

i->±oo t 2 

3.5. Proof of Theorem 13.41 In the 3m d ^ case, or the case where d = and 
T < 1, we could also use the scheme described in Section l3~4l We prefer illustrate 
the method of contraction, where we will see that the counterpart of the singular 
contribution is an afhne part. To simplify the exposition, we assume the problem 
one-dimensional (consider only the first component) and d > 0. 

Since 

^(2>,0) = ^(T,.s + 2d,0) 
os os 

= J(l+s + 2d) -J(l-T + s + 2d) 
-J(l + ^+d) + J(l-T+^+d) 

and since J [a + s) — J{b + s) = (a — b) log s + o(l) as s — ¥ 00, we have 

Br 

lim — l( s ,0) =Tlog2 

s->oo OS 

which corresponds to the endpoint of the interval allowed for 9iz log $„ (T) . On the 
other side, if d > 0, 

Jim (T, S , 0) = J{T) -l-j( ilZT) + J ( 1^) =: £ T . 

s4--(l-T)-2d <9s V 2 / V 2 / 

There is a problem of non-steepness since it is not —00. 

The first coordinate satisfies the LDP with good rate function 

t*M{ho(T,Z,r))\ri} 
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and from (|3~4|) 

inf {h (T, C V )\ V } = M{H (T, <p, $)\<p{T) = £} 

From the structure of Jio and H a , it is clear that this infimum is achieved for 
ip(.) = i.e. p = 0. 

In the case of admissible tp, it remains to compute 7. Let us study the mapping 
7 i-> y(T, 7) where </?(•, 7) is given by f|3 . 24[) with p = 0. Wc follow the lines of 
argument of [22] pp. 3216-3218. We have 



cV(T, 7 ) I+7 1 1 



■ 



(3.27) m ^ = i og - ' -^log \'* >0. 

07 1 — i+7 2 1 — J + 

The mapping 7 H> ^(T, 7) is then bijective from [—(1 — T),oo) to [£r,T'log2). 

Fixing £ € (— 00, T log 2], let us look for optimal ip. Let 7 > —(1 — T) (playing 
the role of a Lagrange multiplier). By the duality property, we have the inequality 

(1 - t)L* (p a (r)) > 7 ^(r) - (1 - r)L 

Using (|3.2p and (|3.22l) we get, by integration of the above inequality: 

n o (T,p,0) = [ (l-r)L*(p a (T))dT+ [ (i_ r )d(-£,)(r) 
Jo Jo 

> 7<Pa(T)-J (1 - r)L (^-,0\ dr 

+ [ T (l-r)d(~ip s )(r). 
Jo 

For every tp such that <p(T) = £, we then have 

H Q (T,tp,0) > 7 C-^ (l-T)L(^-,o)dr- (l-r + 7 )# s (r) 

(3.28) > / (l-rjLf-^.O^dr. 







1 - T' 



We can now distinguish three cases: 

• If £ G [£T,7Tog2), we choose the path absolutely continuous and such that 

v£(t) = log(l - r + 7«) - log fl - r + X-Y 

where 7^ is uniquely determined by the condition u*(T) = £. This path saturates 
the infimum and the expression of the action integral is clear. 

• If £ < £r, set e = £ T - f. Plugging 7 = -(1 - T) in (PHSl) yields for v such 
that u(T) = £: 

Ho(T, v, 0) > -(1 - T)£ - jf (1 - r)L ( "^J^ .o) = (1 - T)e + &o(T, $r, 0) 
and this lower bound is achieved for the measure v — v^ T {r)dr — sSt, since 
H a (v^) = h (T,^ T ,Q) , / (l-r)ed*r(T) = (l-T)e. 
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If £ = Tlog2, make £ = Tlog2 in IpOS]) . We get, for all 7 > -(1 - T), 



(3.29) 



/i (T,Tlog2,0)>7Tlog2-^ (1 - r)L (7^7* °) dT 



When 7 -> 00, the integral is F(l + 7) - F(l - T + 7) + F(l) - F(l - T) - 2F(1 + 
7/2) + 2F(1 - T + 7/2) which tends to -co as -Tlog7, so that finally, the RHS 
(|3.29[) tends to 00 and we conclude ho(T, T log 2,0) = 00. 

4. Connection with the spectral method 

It can be interesting to connect the results of the previous section to the results 
obtained by looking directly at the empirical spectral distribution of the ensembles 
which are considered. This point of view is also discussed in [12] . 

The LDP for the empirical spectral distribution of the unitary ensemble is given 
in [llj . The rate function is the Voiculescu's logarithmic entropy: 

(4.1) Ifr) = -St(m) ■=-[[ log \z - z'\ dfi(z)dfi(z'). 

J JTxT 

The circular Jacobi unitary ensemble yields also a LDP given in [5]. The rate 
function is 

(4.2) I d ( M ) = -£t(m) + / QAz)d^(z) + B(d) , 
where 

Q 

(4.3) Q d (e ie ) := -2(%> d) log (2 sin -) - pmd)(0 - tt) (0 G (0, 2tt)) . 
and 

B(d) = [ [(x + 2med)log(x + 2*Rtd)-2me[(x + d)log(x + d)]]dx 
Jo 

(4.4) + / a; log da;. 

Jo 

(Notice that there was a mistake in [5], fixed in the arXiv version). With our 
notation, it yields: 

B(d) = F(l + 2<Ke d) - F(2fRe d) - 2<Ke F(l + d) + 2<Kc F(d) + F(l) . 

If the mapping fj, € .M i(T) 1— > J log(l — z) d[i(z) were continuous, we would have 
by contraction: 

(4.5) h d (l, e, rj) = inf{/ d (/i) | [i e M^T) : / log(l - 2) d/x(z) = £ + ir/} . 



We conjecture that this formula holds nevertheless, and we prove it in the one- 
dimensional case. 

Proposition 4.1. With the notation of Section 3, we have, for d > and for every 

£e[0,lo g 2) 

(4.6) /i d (U,0) = inf{/ d (AO I I \og\\-z\d^z)=Q. 

Jt 

We use the following interesting result. 
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Proposition 4.2. We have 

(4.7) inf{-S T (/i) | fi € Mi(T) : [ log |1 - z\ d/i(z) = £} = -E T ( Ma ) 
where 

• f/ie measure fi a € A^i(T) is defined by 



J sin 2 (#/2) -sin 2 (0 Q /2) 
(4.8) d Ma (z) = (1 + a)-* ^ s . n l(fl.,!hr-fl.)(0) <W 

where z — e lS , 8 G [0, 27r] and (9 a G (0, 7r) is sitc/i </ia< sin(9 a /2 = yq^- 
• a is i/ie unique solution of 



/ log|l-z|d// a (z) = £. 



Moreoi 

arg(l - z) d/i (z) = . 

The two propositions mean that to get the non standard mean value £ for the 
logarithm of the determinant, the most probable way is to force the random operator 
to get an empirical spectral distribution close to /j. a - 

Proof of Proposition \4.1\ given Proposi tion \4.2l Let us first suppose that the Propo- 
sition [4J] is true for d = 0. One deduces, from this assumption and the definition 
dH of J d : 



inf{/ d (/x) | n G Mi(T) : J log |1 - z\dfi{z) = £} 

= inf{-S T (/i) | ft G Mx(T) : / log |1 - z\d f i(z) = £} + B(d) - 2d£ 
= ho(l, & 0) + B(d) - 2d£ = ft d (l, £, 0), 

the last equality coming from (I3.5[) and the fact that -B(d) = — C d (l). Hence, it is 
sufficient to prove Proposition 14. 1 1 for d = 0. 

The RHS of (|4.6p is — 'Ey(p, a ) and it remains to prove that it fits with the value 

h (l, £, 0) = 7 £ - ^(1 + 7) + F( 7 ) - F(l) + 2F (l + 1) - 2F (1) 
where in view of Theorem [33] (2), 7 is the solution of (|3.6|) i.e. 

(4.9) ,7(1+^-^)- + +^(|) =£. 

Applying [5] formula (5.25) in the special case d = a (real and positive), we see that, 
since the corresponding rate function vanishes for the limiting empirical measure 

Ha, 

(4.10) £ T (Ma)= / Q a (Qdix a (()+B(a), 

Jt 

where Q a {() = —2a log |1 — £|. Let us now compute: 

(4.11) /:= /log|l-C| d/x„(C). 

1 + a Jt 
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We have, by definition of fi a in (|4.8[) , 



f 2 — «■ */sin 2 (0/2)-sin 2 (0 a /2) 

/ = / log(2 S in(0/2))V r ^ d9 

Je a sm(0/2) 

r Jsm 2 {e/2)-sm 2 {e a /2) 

(4.12) = 2/ log(2sin(0/2)) V r ^ dfl s 

Je a sm(0/2) 

for 9 a £ [0,7r] and sin(0 a /2) = a/(l + a). The hrst change of variables u — sin(0/2) 
gives 



d6 = 



2du 



2du 



cos(0/2) 71^2-' 



and then 



7 = 4 

/o/(l+o) 

The second change of variable 



io g (2 U ; 



u 2 - [a/(l + a)] 2 
1 — v? u 



u 2 - [a/(l + a)] 2 
1-u 2 



t» 2 + [a/(l + a)p 
l + t> 2 



gives 



Iog(2«) - log 2 + ~ log (v 2 + [a/(l + a)] 2 ) - 1 log(« 2 + 1), 



it 



w 2 + [a/(l + a)] 2 « 2 + l 



Hence, 
1 = 4 
(4.13) 



log 2 + 1 log (u 2 + [a/(l + a)] 2 ) - i log(« 2 + 1) 



log 2 + I log (i, 2 + [a/(l + a)] 2 ) - ± log(i, 2 + 1) 



w 2 + [a/(l + a)] 2 u 2 + 1 
1 [a/(l + a)] 2 



v 2 + 1 w 2 + [a/(l + a)] s 



dv. 



Now, for a, j8 > 0, 
(4.14) 

(4.15) 



a 



■rdv = 



j v 2 + a 2 

00 a 2 log^+Z? 2 ) ^ 

5 5 <fo = 2a7rlog(a + p). 

-on « + or 



The first equality (I4.14[) is elementary. To get f|4. 1 5[) , one can differentiate the LHS 
with respect to /3, and one obtains, for a ^ f3, 



2a 2 /3 



dv = 



2a 2 /? 



„ oc (v 2 + a 2 )(v 2 + /3 2 ) /3 2 - a 



dv 



-oo v 2 + a 2 J_ 00 V 2 + (3 



dv 



2ira 
a + f3' 



By continuity, the last equality remains true for a = /3, and reversing the differen- 
tiation gives 

r °° a 2 \og{v 2 + /3 2 ) 



v 2 + a 2 



-dv = 2an log(a + /3) + C{a). 
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Now, to determinate C(a) let us write 

o? log^ + Z^ 2 [°° a [°° alogO+^V^) 
dv = a log(p J / -= ydv + a / 5 5 aw. 



The first integral is equal to 2a7rlog(/3) and the second tends to zero for a fixed 
and P — > 00 by dominated convergence. This gives C(a) = and then (|4.15p . 
Expanding (|4.13p . then using (I4.14|) and f|4.15[) gives: 

/ = [(1 + 2a) log(l + 2a) - (1 + a) log(l + a) - 2a log(2a) + a log(a)] . 

1 + a 

Coming back to the definition of J, we obtain: 

f log |1 - C| (WC) = 1(1 + 2a) - 1(1 + a) - l(2a)+l(a) = S a (l) . 
Jt 

In particular, 

/" log |1 - CI a> 7 / 2 (C) = £, 

JT 

and the value of a involved in Proposition 14.21 is equal to 7/2. It remains to check 
that — Et(/x 7 / 2 ) = ^o(1j£ ; 0). Indeed, 



-ErOva) = -/ Q 7 /2(CMM7/2(C)-S( 7 /2) = 7^-^(7/2) 

= 7f - ^(1 + 7) + ^(7) + 2F(1 + 7/2) - 2F( 7 /2) - F(l) , 
which proves Proposition ^. II 



Remark 4.3. The determination of I could also be viewed as a consequence of 
the weak convergence of the empirical spectral measure under CJ na pi . Of course 
the mapping /i i-> J T log|l — CI ^A*(C) * s n °t continuous, but since the extremal 
eigenvalues converge to the extreme points of the support of \i a (see [2],), we have 

[ Iog|l-C|<WC)= l™ -log|$„(l)| =S a (l) 

where the last equality comes from \2. 9\) in Theorem \2.1\ 

Proof of Proposition \4.2\ A possible way consists in studying the maximization of 
the functional 



E T (^)+r / log|l-z| dfx(z), 
Jt 



where r is some Lagrange multiplier. 

It is convenient to push-forward this problem to R via the Cayley transformation 

z = e^ = ^ 
A — 1 

so that, for z, z' 7^ 1, 

log 1 1 - 2r| = -ilog(l + A 2 )+log2 

log|z-z'| = log|A-A'|-^log(l + A 2 )-ilog(l + A' 2 )+log2. 
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Hence, one needs to minimize: 

/ ()(\),h,(\). 

over v 6 A^i(M), where 



and 



2Q(a:)= (l + -)log(l + x 2 ). 



This problem can be connected to the research of the equilibrium measure in the 
Cauchy ensemble, i.e. for the Coulomb gas with probability distribution given by 
formula (3.124) in Forrester [TU]. According to Saff and Totik [23], this potential Q 
is admissible (p. 27) as soon as r > 0. In that case the minimizer is unique (Theorem 
1.3), its support is compact. In the book, an explicit method is presented for a class 
of potentials satisfying some conditions, in particular if Q is even, diffcrcntiable and 
such that xQ'(x) is positive and increasing in (0, oo). These properties are satisfied 
here since: 

xQ'(x) = (1 



2> l + x 2 

Then ([24] Corollary 1.12 p. 203) the support is S = [—6, b] where b is solution of 

2 f 1 btQ'tbt) 
4.16 - / /* V ' dt = 1 

2 + r f 1 b 2 t 2 

—— / ; dt = 1 

TT Jo (l + b 2 t 2 )VT^ 

or 

(4.17) t - = nr 

y ' Jo (l + 6 2 t 2 )vT^F 2(2 + r) 

which gives (make t = (1 + t 2 )~ 1//2 ) 

dr Trr 



1 + b 2 + t 2 2(2 + r) 



i.e. 



r 

To find the extremal measure, we apply Theorem IV. 3.1 in [24], which contains the 
following result: 

Theorem 4.4 (Lubinsky-Saff ) . Xe< f be a differentiable even function on [— 1, +1], 
smc/i £/ia£ sf'(s) is increasing for s € (0, 1) and for some 1 < p < 2, f'{s)/\l ~ s 2 € 
i p [— 1, 1]. Then the integral equation 

nl 1 

log, -g(t)dt = -f(x) + C f , xe(-l,l), 

l \x-t\ 

(where Cf is some constant) has a solution of the form 
(4-18) g(t)=L[f'](t)+ Bf 

7TV1 — t Z 
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where 

■' (»/'(») -«/(()) 



(4 . 19) L[m . i/V-WW-TO)^ 

„, = i-I/'-gfcL*. 



Here, we look for the measure /i with support [—6,6] such that for z G (—6, 6), 

(4.20) f \og— !—dfi(t) = -Q(z) + C. 

J-b \ z ^ l \ 

Theorem 14.41 is set for 6 = 1, so we need a scaling. Equation (I4.20p becomes, with 
d\i(x) = g b {x)dx, 

f 1 1 

(4.21) / log, -bg b {bt)dt = -Q(bz) + C. 

J-i V- — 1\ 

Theorem l4.4l can now be applied by taking f(s) = Q(bs), and one has bgb{bt) = g(t), 



B f = and 



sf(s)-tf(t) _ b 2( 1+ r_\ 



s 2 -t 2 V 2/ (l + 6 2 i 2 )(l + 6 2 s 2 ) 

Owing to (|4.17p . we get 



9bto = JZ I[-M](*)- 



i.e. 

6?r (1 + a; 2 ) 

Now, it is enough to apply Theorem 1.3.3 of [Mj as follows. The support of the 
equilibrium measure is S = [—6,6], the measure /i is supported by 5 1 , has a finite 
logarithmic energy and f_ b log -n^r + Q(z) is constant for z G 5, so /i is the 
equilibrium measure. To find /i a , it is enough to carry /i back on the circle by the 
inverse transformation 

1 1 + z 

i 1 — z 



5. Appendix 

5.1. Some properties of £ = logT and W = £'. From the Binet formula (Abramowitz 
and Stegun [I] or Erdelyi et al. [9] p. 21), we have for D\ex > 

1 f°° 

(5.1) £{x) = (x- ~)]ogx-x + l+ / /(s)[e~ s;E -e- s ]ds 

2 Jo 

(5.2) = (i-ijlogs-ar + i^TrJ + y 00 /^-"^, 
where the function / is defined by 
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and satisfies for every s > 0: 



(5.4) 



< f(s) < /(0) = 1/12 , 0< [sf(s) + 



1 



< 1 



By differentiation (recali that VP is the Digamma function T'/T), 



-i /"OO />00 

(5.5) logs = — + / sf(s)e- sx ds= e~ 
2x Jo Jo 

Moreover, since ty(x + 1) = i + ^{x), we have a variation of (|5.5[) : 



s /( s ) + -) rfs . 



(5.6) 



logx - *(cc + 1) = -— + / sf(s)e~ sx ds 
zx 



As easy consequences, we have, for every x > 0, 



(5.7) 
(5.8) 



< x (logs - i&(x)) < 1 , 
< logx-*(x)-±-<^ 



2x ~ 12x 

Differentiating again we see that for q > 1, $Hex > 0, 

= (-l) 9 - 1 ^ - iy-*- q + (-I)*- 1 e-* x si (sf(s) + 

and then 

(5.9) \^ q \x) - (-l) q -\q - l)lx- q \ < (mex)-"- 1 ^.. 

Another useful formula is 



ds 



k=l 



(5.10) 
fir z + 1 g M_. 

5.2. The density <?£ (5 ' ) and some moments related to it. Recall that for r > 
and 5 such that r + 29\zS + 1 > 0, the density on the unit disc D is given by 

ff W(«)= Cr ,«(i-H 3 r" 1 (i-«) I (i-2) tf 

where c r .a is the normalization constant. The following lemma is the key to compute 
c r ,s and the moments of g[ S \ 

Lemma 5.1. Let s,t,£ be complex numbers such that: 9Kt£, V\e(s + £ + l), $Re(i + 
I + 1) and Uie(s + t + £ + 1) are strictly positive. Then, the following identity holds: 



(5.11) / (1 - |z| 2 r -1 (l - z) s (l - zfd 2 z = nT 



£ , £ + 1 + s + 1 
• + 1 + s , £ + l + t 

where for the sake of simplicity we use the polygamma symbol 

r(o)r(6).- 



a,b,--- 
c, d, ■ ■ ■ 



T(c)T(d) ■ 
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A proof of this result is given in [5] . A first consequence is that 
(5.12) 



Cr.s = tt F 



r+1+5 , r + 1 + 5 
r , r +1+5+5 



A second consequence is that if 7 has the density gi j then we have 



E(i- 7 ) Q (i- 7 )" = r 

(5.13) 



r + l + 5 + S_ + a + b, r + l + S , r + 1 + 5 
r +1+5+6 } r + 1 +5 + a , r+l+5+b 



as soon as all the real parts of the arguments of the gamma functions are strictly 
positive. 

Let us notice that for r = the RHS of (|5.13[) is the Mellin-Fourier transform of 
1 — 7 when 7 g T is distributed according to X 1 -^. 

In this paper, we need the following computations, in order to deduce the mo- 
ments of log(l — 7). The quantities involved below are all well-defined as soon as 
s > so, where sq is some strictly negative quantity depending on r and 5, and in 
particular, for (s, t) in the neighborhood of (0,0), one can write: 



A(s,t) := logEexp(2s^elog(l -7) + 2i3mlog(l -7)) 

= logEcxp(JHe(2(s-if)log(l- 7 )) = 

= logE(l-7) s ~"(l-7) s+it = 

(5.14) = £(r + l + 5 + 5 + 2s) -£(r + 1 + 5 + 5) 

-I (r + 1 + 6 + s - it J - i(r + 1 + 5 + s + it 

+e(r+l + 5) +£(r + l + 5). 

To compute moments we need differentiation. First we have: 



— A(s,t) = 2* (r + 1 + 5 + 5+ 2s) 

OS 



(5.15) 



m H;t) 



(r + 1 + 5 + s + it) - * (r + 1 + 5 + s - it 
i\P ( r + 1 + 5 + s - it J -i^(r+l + 5 + s + it 



The first moment is then: 

E$Helog(l-7) = V(r + l + 5 + 5) - ^(r + 1 + 5) - -*(r + 1 + 5) 



E3mlog(l - 7) 



^(r + 1 + 5)- j^(r + 1 + 5) 



(5.16) Elog(l - 7) = *(r + 1 + 5 + 5) - #(r + 1 + s ) • 
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Differentiating again (|5.15[) we get 

d 2 

— Ms,t) = W(r + l + 5 + S + 2i 



*'(r + 1 + 5 + s + it ) - *'(r + 1 + S + s - it 



(5.17) ^A(s,«) = ^'(r+l + 5 + s-itj + ^' (r + I + 5 + s + it 

■q^qM s ^) = -i^'(r + l + 5 + s + itj +W'(r + l + 5 + s-it) 
and the second moments are 

VarfKelog(l -7) = *'(r + 1 + S + S) - 1 + 6) - ~#'(r + 1 + 5) 

(5.18) VarJmlog(l-7) = ^'(r + l + S) + ^Z>' (r + 1 + 5) 

Cov(JKelog(l-7),Jmlog(l -7)) = —V'(r + l + 6) - ~tf'(r + l + $). 

5.3. Complex logarithm and characteristic polynomial. Let Ef. be the set of 

the complex k x k matrices with no eigenvalue on the interval [l,oo). For V £ Ek, 
let us define 

k 

logdet(J fc -V) :=^log(l-A,), 

j'=i 

where the Aj's are the roots, counted with multiplicity, of the polynomial z 1— > 
det(zlk — V), and where in the right-hand side, one considers the principal branch 
of the logarithm. This definition is meaningful, since by assumption, 1 — \j ^ R_ 
for all j € {1, . . . , k}. By the continuity of the set of roots of a polynomial with 
respect to its coefficients, the set Ek is open and the function V i-> logdet(/fc — V) 
defined just above is continuous on Ek- In fact, since Ek is connected (this is 
easily checked by tridiagonalizing the matrices), this is the unique way to define 
the logarithm of det(/fc — V) as a continuous function of V € Ek if we assume that 
it should take the value zero at V = Q. 

Now, with the notation of the beginning of the paper, the matrix Gk{U n ) is a 
submatrix of the unitary matrix U n , and all its eigenvalues have modulus bounded 
by 1. If we assume 70, . . . , 7,1-1 7^ 1 (which holds almost surely under Cji then 
by (|1.2[) . $fe, rl (l) 7^ 0, and one easily deduces that Gk(U n ) € Ek, which allows to 
define log$fc )n (l) without ambiguity. Now, the map from ED™ -1 x (U\{1}) to M, 
given by 

fc-i 

( 7 i,...,7„_i) i-» ^log(l-7j) 

3=0 

is continuous if we take the principal branch of the logarithm, and since U n depends 
continuously on (7^ . . . ,7„_i) € D™ -1 x (U\{1}), as it can be checked in [5], the 
map 

(71 1 • • • ,7n-l) l-> log$fc,n(l) 
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is also continuous. These two maps have the same exponential, and one can check 
that they are both real if the 7/s are all real. Hence, they are equal, which fully 
justifies the equation 

fe-i 

(5-19) log$ Jk , w (l)=53log(l- 7i ). 

3=0 

5.4. Abel-Plana summation formula. 

Theorem 5.2. Let m < n be integers and let g be a holomorphic function on the 
strip {t € C,n < 9\et < m} (i.e. g is continuous on this strip and holomorphic in 
its interior). We assume that g{t) — o (exp(27r|3m t\)) as 3m t — > ±00, uniformly 
with respect to *He< G [n, m]. Then, 

j=m+l Jm 

g(m + iy) - g(n + iy) - g{m - iy) + g(n - iy) 
^731 d V- 

(5.20) 

For a proof see [20] p. 290. 
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